Sampling Large Tables with Constraints
نویسندگان
چکیده
We describe a new sequential sampling method for constrained multi-way tables, with foundations in linear programming and sequential normal sampling. The method builds on techniques from other sequential algorithms in a way that scales well and can handle more challenging data sets. We apply the new algorithm to data to demonstrate its efficiency.
منابع مشابه
Conditional Inference on Tables with Structural Zeros
We describe a sequential importance sampling approach to making conditional inferences on two-way zero-one and contingency tables with fixed marginal sums and a given set of structural zeros. Our method enables us to approximate closely the null distributions of various test statistics about these tables, as well as to obtain an accurate estimate of the total number of tables satisfying the con...
متن کاملSequential Importance Sampling for Multiway Tables
We describe an algorithm for the sequential sampling of entries in multiway contingency tables with given constraints. The algorithm can be used for computations in exact conditional inference. To justify the algorithm, a theory relates sampling values at each step to properties of the associated toric ideal using computational commutative algebra. In particular, the property of interval cell c...
متن کاملMinimal basis for connected Markov chain over 3× 3×K contingency tables with fixed two-dimensional marginals
We consider connected Markov chain for sampling 3 × 3 × K contingency tables having fixed two-dimensional marginal totals. Such sampling arises in performing various tests of the hypothesis of no three-factor interactions. Markov chain algorithm is a valuable tool for evaluating p values, especially for sparse data sets where large-sample theory does not work well. For constructing a connected ...
متن کاملElectricity Procurement for Large Consumers with Second Order Stochastic Dominance Constraints
This paper presents a decision making approach for mid-term scheduling of large industrial consumers based on the recently introduced class of Stochastic Dominance (SD)- constrained stochastic programming. In this study, the electricity price in the pool as well as the rate of availability (unavailability) of the generating unit (forced outage rate) is considered as uncertain parameters. Th...
متن کاملExact P - values in Incomplete Multi - way Tables ∗
I develop a new Markov chain algorithm for sampling from sets of multi-way contingency tables defined by an arbitrary set of fixed marginals and by lower and upper bounds constraints on cell counts. My procedure is called the Bounds Sampling Algorithm (BSA) and it relies on the existence of a method to calculate lower and upper bounds for cell entries. BSA accommodates any pattern of structural...
متن کامل